Adaptive constraint satisfaction for Markov decision process congestion games: Application to transportation networks
نویسندگان
چکیده
Under the Markov decision process (MDP) congestion game framework, we study problem of enforcing population distribution constraints on a players with stochastic dynamics and coupled costs. Existing research demonstrates that players’ can be satisfied by tolls. However, computing minimum toll value for constraint satisfaction requires accurate modeling player’s Motivated settings where an cost model may unavailable (e.g. transportation networks), consider MDP unknown We assume constraint-enforcing authority repeatedly enforce tolls converges to ϵ-optimal any given toll. then construct myopic update algorithm compute while ensuring are average. analyze how sub-optimal responses impact rates convergence towards satisfaction. Finally, Uber drivers in Manhattan, New York City (NYC) using data from Taxi Limousine Commission (TLC) illustrate efficiently reduce minimizing driver earnings.
منابع مشابه
Learning Chordal Markov Networks by Constraint Satisfaction
We investigate the problem of learning the structure of a Markov network from data. It is shown that the structure of such networks can be described in terms of constraints which enables the use of existing solver technology with optimization capabilities to compute optimal networks starting from initial scores computed from the data. To achieve efficient encodings, we develop a novel character...
متن کاملUnordered Constraint Satisfaction Games
We consider two-player constraint satisfaction games on systems of Boolean constraints, in which the players take turns in selecting one of the available variables and setting it to true or false, with the goal of maximising (for Player I) or minimising (for Player II) the number of satisfied constraints. Unlike in standard QBF-type variable assignment games, we impose no order in which the var...
متن کاملAdaptive Constraint Satisfaction
Many different approaches have been applied to constraint satisfaction. These range from complete backtracking algorithms to sophisticated distributed configurations. However, most research effort in the field of constraint satisfaction algorithms has concentrated on the use of a single algorithm for solving all problems. At the same time, a consensus appears to have developed to the effect tha...
متن کاملGoal Recognition with Markov Logic Networks for Player-Adaptive Games
Goal recognition in digital games involves inferring players’ goals from observed sequences of low-level player actions. Goal recognition models support player-adaptive digital games, which dynamically augment game events in response to player choices for a range of applications, including entertainment, training, and education. However, digital games pose significant challenges for goal recogn...
متن کاملPartially Observable Markov Decision Process Approximations for Adaptive Sensing
Adaptive sensing involves actively managing sensor resources to achieve a sensing task, such as object detection, classification, and tracking, and represents a promising direction for new applications of discrete event system methods. We describe an approach to adaptive sensing based on approximately solving a partially observable Markov decision process (POMDP) formulation of the problem. Suc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Automatica
سال: 2023
ISSN: ['1873-2836', '0005-1098']
DOI: https://doi.org/10.1016/j.automatica.2023.110879